The Survey on Content Addressable Storage

نویسندگان

  • Bhuvan Urgaonkar
  • Bo Zhao
چکیده

1. Introduction Content addressable storage (CAS) is an emerging mechanism that can reduce the costs associated with this volume of data by eliminating such redundancy. Essentially, CAS uses cryptographic hashing techniques to identify data by its content rather than by name. Consequently, a CAS-based system will identify sets of identical objects and only store or transmit a single copy even if higher-level logic maintains multiple copies with different names. CAS is also a data management approach. CAS uses cryptographic hashing to reduce storage requirements by exploiting commonality across multiple data objects. For example, to apply CAS to a system, we would represent each memory and disk image as a sequence of fixed-sized chunk files, where the filename of each chunk is computed using a collision-resistant cryptographic hash function. Since chunks with identical names are assumed to have identical contents, a single chunk on disk can be included in the representations of multiple memory and disk images. The simplest example of this phenomenon is that many memory and disk images contain long strings of zeros, most of which can be represented by a single disk chunk consisting of all zeros.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Content-Addressable and Associative Memory Systems a Survey

This review of content-addressable memory and as-A. Purpose of This Study sociative computer systems represents an attempt to consolidate and report nontechnically the direction in which numerous independent The primary purpose of this study is to provide in-research programs have progressed during the last ten years. The formation bases on which recommendations can be made paper reflects the v...

متن کامل

HydraFS: A High-Throughput File System for the HYDRAstor Content-Addressable Storage System

A content-addressable storage (CAS) system is a valuable tool for building storage solutions, providing efficiency by automatically detecting and eliminating duplicate blocks; it can also be capable of high throughput, at least for streaming access. However, the absence of a standardized API is a barrier to the use of CAS for existing applications. Additionally, applications would have to deal ...

متن کامل

Evaluating the Usefulness of Content Addressable Storage

Content Addressable Storage (CAS) is increasingly being used as a technique to provide for space savings when storing datasets. In this paper we analyze the performance of CAS on real-world applications and discuss the effects on space savings, savings in network bandwidth and on resultant error resilience of data. We find that a chunksize of 1 KB can provide up to 84% space savings and even hi...

متن کامل

Survey on Content Addressable Memory and Sparse Clustered Network

Most memory devices store and retrieve data by addressing specific memory locations. As a result, this path often becomes the limiting factor for systems that rely on fast memory accesses. The time required to find an item stored in memory can be reduced considerably if the item can be identified for access by its content rather than by its address. A memory that is accessed in this way is call...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007